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1.  Technical  Summary  of  the  Project 

Two  challenges  confront  efforts  to  develop  effective  intelligent  systems  to  work  in  complex 
domains:  making  them  easier,  faster  and  less  expensive  to  develop,  and  making  them  easier 
to  comprehend,  use,  and  trust  (Haynes,  Councill,  &  Ritter,  2004).  This  applies  to  cognitive 
models  as  well  (Ritter,  2004).  We  have  created  a  high-level  behavior  representation 
language  and  integrated  development  environment  (IDE)  called  Herbal  (Cohen,  Ritter,  & 
Haynes,  2005)  that  answers  these  challenges.  Herbal  provides  explanations,  it  works,  has 
been  used  by  60  people  as  the  basis  of  a  course  on  modeling  and  as  a  tutorial  at  the  BRIMS 
conference,  and  the  data  so  far  suggest  a  3  to  7  x  performance  increase  compared  to 
developing  Soar  code  directly.  The  next  steps  are  to  evaluate  and  extend  Herbal,  and  to 
apply  Herbal  to  a  real-world  problem. 

This  proposal  makes  intelligent  systems  more  comprehensible  to  developers,  users,  and 
other  stakeholders  through  the  development  and  evaluation  of  the  high-level  behavior 
representation  language,  Herbal,  and  related  tools.  These  systems  are  designed  from  the 
ground  up  based  on  user  studies  and  explanation  theory  to  account  for  the  complexities  of 
human-computer  interaction  with  intelligent  systems,  both  in  development  and  in  use.  Two 
key  design  features  of  the  approach  are  the  ability  to  capture  explicit  design  rationale  from 
development  activities,  and  then  to  use  this  design  knowledge  as  the  basis  for  explanations 
of  the  resulting  system’s  structure,  behaviors,  and  purpose. 

Designing,  implementing,  and  using  intelligent  systems  can  be  difficult  tasks  considering 
that  the  background  and  expertise  of  developers  can  vary  from  novice  to  expert  (Ritter  et  al., 
2003).  Part  of  the  problem  stems  from  the  fact  that  cognitive  architectures  most  often  use 
low-level  programming  languages  to  model  behavior.  Both  Soar  (Laird,  Newell,  & 
Rosenbloom,  1987)  and  ACT-R  (Anderson  et  al.,  2004)  for  example,  use  production  rules  as 
their  primary  programming  construct.  A  wider  variety  of  users  can  be  supported  in  the  task 
of  developing  intelligent  systems  using  a  language  that  maps  more  directly  to  the  user’s 
domain.  For  example,  if  this  high  level  language  is  built  on  top  of  Soar,  and  closely 
resembles  the  problem  space  theory  used  by  the  Soar  architecture  (Newell,  1990),  it  would 
make  Soar  easier  to  use,  while  maintaining  the  features  of  Soar  that  make  it  interesting, 
including  learning,  interaction,  and  reactivity. 

Herbal  provides  explanations 

Herbal  provides  an  integrated  development  environment  for  users  to  create  models  in  Soar. 
The  resulting  model  is  compiled  into  Soar  productions.  The  IDE  and  an  associate  viewer  for 
use  at  runtime  (both  shown  in  Figure  1)  help  to  explain  the  structure  and  behavior  of 
cognitive  models. 
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Figure  1.  Herbal  interface  (left)  and  Herbal  Viewer  Interface  (right).  An  operator 
instance  pane  is  on  the  top  of  the  IDE.  The  Herbal  and  model  ontologies  are  visible  in 

the  far  left  pane  of  each  window. 


Herbal  provides  support  for  answering  core  model-related  questions:  What  questions  of 
models  (identity  of  objects,  definition,  relation,  and  events),  How  questions  (how  do  I  use 
the  model,  how  do  I  use  the  model’s  interface),  How  does  the  model  work.  Where  questions 
(where  did  the  event  happen),  Functional  explanations  (the  purpose  of  the  model),  and 
Pragmatic  explanations  (why  didn’t  something  else  happen?).  The  explanations  are 
available  in  graphical  and  textual  output. 

Herbal  works  now 

Herbal  has  been  taught  to  over  60  people,  35  for  a  semester  course  on  cognitive  modeling  at 
Penn  State  (1ST  402  Emerging  technologies:  Models  of  human  performance),  and  30  paying 
tutees  for  3  hours  at  BRIMS  (Ritter,  Morgan,  Stevenson,  &  Cohen,  2005).  This  usage  has 
improved  Herbal,  and  suggests  that  Herbal  has  all  the  functionality  Soar  provides  on  its  own, 
but  is  easier  to  use.  A  manual  supports  usability  (Cohen  &  Ritter,  2004),  and  to  improve  its 
usability  we  are  currently  building  a  library  of  models. 

Herbal’s  design  is  robust  and  reusable 

Herbal  uses  an  explicit  design.  User  needs  were  gathered  from  a  study  on  how  experts  and 
novices  interact  with  Soar  Tech’s  Situation  Awareness  Panel  (Avraamides  &  Ritter,  2002). 
The  study  results  were  reanalyzed  to  find  the  kinds  of  questions  Soar  users  ask  (Councill, 
Haynes,  &  Ritter,  2003;  Haynes,  Ritter,  Councill,  &  Cohen,  submitted  January  2005).  These 
results  are  highlighted  in  Figure  2.  The  users’  questions  were  used  along  with  several 
explanation  scenarios  to  generate  a  robust,  reusable,  responsibility-based  design  based  on  a 
compiler  (Haynes  et  al.,  2004).  This  explanation  approach  is  based  on  the  structure  of  the 
model  as  well  as  information  gathered  from  the  modeler  as  they  create  the  model  (design 
rationale)  (Haynes,  2001). 
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What  Users  Want 

(based  on  4  expert  SAP  users,  tu  Councill  et  al.,  2003) 
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Figure  2.  Summary  of  the  usability  study  on  what  users  want  (Councill  et  al.,  2003). 
The  circles  indicate  major  categories  of  questions. 


Herbal’s  IDE  is  realized  as  an  extension  to  popular  ontology  editor,  Protege,  and  an  existing 
ontology  language,  RDF.  The  basic  ontology,  the  objects  models  are  created  from,  is  shown 
in  Figure  3.  This  use  of  standard  tools  makes  it  easier  for  new  modelers  to  adopt  the 
development  environment,  and  means  that  cognitive  models  are  represented  in  a  portable 
data  format.  A  model  is  compiled  into  Soar  rules  using  an  XSLT  script.  The  explanations 
are  provided  graphically  in  the  IDE  as  well  as  in  a  tracing  tool.  Many  explanations  are 
graphical,  but  we  are  also  generating  textual  explanations  concurrent  with  the  model 
running. 

These  results  (data  and  design)  are  reusable  by  all  intelligent  agents  that  want  to  create 
explanations — we  have  reported  what  needs  to  be  done  to  support  explanations  (Haynes  et 
al.,  submitted  January  2005). 

El  ©HerbalClassA 
©  Model 
B  ©PSCMA 
0  ©Slate 

©  TopState 
0  ©Production 
©  Operator 
©  Elaboration 
©  Impasse 
0  ©  Expression  A 

pm 

©  Action 
©WMObject 

Figure  3.  Herbal  Ontology  Class  Hierarchy,  taken  from  its  interface. 
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Herbal  appears  easy  to  use 

Herbal  can  be  taught  within  a  semester  course.  While  we  did  not  perform  formal  studies, 
anecdotally,  students  taking  this  course  found  Soar  easier  to  use  than  previous  classes  have. 
When  an  undergraduate  created  a  medium-sized  dTank  model,  he  was  writing  code 
(measured  as  productions)  as  quickly  as  a  CMU  graduate  student  using  the  TAQL  Soar 
compiler,  which  itself  was  3x  normal  Soar  speed  (Yost,  1993).  This  result  is  documented  in 
a  recent  conference  paper  (Morgan,  Cohen,  Haynes,  &  Ritter,  2005).  (The  UG  is  now 
working  at  Soar  Tech  full-time).  Figure  4  shows  the  time  per  rule  (up  to  about  30  rules), 
and  the  cumulative  average  time  per  rule.  By  the  end  of  the  model  the  time  per  rule  is 
approximately  3  minutes  per  rule.  This  is  the  time  that  Yost  reported  for  himself  and  other 
CMU  graduate  students  using  TAQL. 


Production  Pair  (Propose  +  Apply) 


Figure  4.  Time  per  rule  (solid)  and  cumulate  average  time  per  rule  (dashed). 

Herbal  has  a  code  expansion  ratio  of  approximately  8  to  1  on  a  simple  system  (blocks 
world).  This  number  would,  of  course,  change  with  different  systems.  The  expansion  ratio 
is  fair  because  comments  represent  either  explanation  material  currently  used,  or  explanation 
material  to  be  used.  Brooks  (of  the  Mythical  man-month  book,  1975)  would  argue  that  this 
will  lead  to  a  8x  speedup  writing  code  because  programmers  he  studied  appear  to  program  a 
set  amount  of  code  per  day  across  languages,  thus  higher  level  languages  are  more 
productive.  Here,  however,  some  of  the  8x  is  providing  explanation,  not  just  decreasing 
programming  time. 

The  results  of  the  G2A  compiler  (GOMS  to  ACT-R)  system  is  consistent  with  the  Herbal 
results.  G2A  was  also  created  as  part  of  this  project  to  explore  high-level  behavior 
compilers.  G2A  provides  a  7.5x  code  expansion.  It  allowed  a  developer  to  create  ACT-R 
models  in  30  min.  that  took  a  good  masters  student  2  weeks  (~  60  hours)  to  write  in  ACT-R. 
The  speed-up  reported  at  a  recent  ONR  workshop  of  up  to  30x  is  thus  conservative — the 
speedup  is  probably  more'like  lOOx.  This  speedup  is  documented  in  a  journal  article  (St. 
Amant,  Freed,  &  Ritter,  2005),  based  on  a  conference  paper  that  won  best  applied  paper  at 
the  ICCM  conference  (St.  Amant  &  Ritter,  2004). 
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There  is  increasing  interest  in  high  level  modeling  languages,  for  example,  Visual  Soar, 
SoarDoc,  ViSoar,  CAST,  and  HLSR,  In  comparison  to  other  projects  in  this  area,  John  et 
al’s  work  creates  models,  but  not  as  fast  as  G2A,  and  not  as  generally  as  Herbal.  The 
Pearson/Laird  system  does  not  provide  as  general  a  tool  as  Herbal.  HLSR  and  the 
Pearson/Laird  tool  are  not  implemented  to  the  same  level  as  Herbal,  and  certainly  not  tested 
to  the  same  degree  as  Herbal,  in  terms  of  general  usability,  programming  speed,  and  code 
expansion.  We  were  struck  by  how  many  things  Herbal  does  now  that  others  propose  to  do. 
Herbal  generates  the  types  of  explanations  users  want,  and  a  mapping  from  question  type  to 
answer  was  provided  in  the  talk  slides  presented  in  June  2005. 

Herbal  has  been  praised  publicly  by  several  people.  Chris  Glur  on  the  Soar  mailing  list  in 
June  2005  noted  “After  the  first  few  hours  spent  looking  into  SOAR,  I  wondered  why  one 
was  forced  to  grovel  down  at  low  level.  And  I’ve  been  waiting  decade(s)  for  what  Herbal 
seems  to  describe.”  Brian  Haugh  of  IDA  noted  at  the  BRIMS ’05  tutorial  that  “. .  .it  deserves 
more  funding.”  During  and  after  the  BRIMS  tutorial  we  received  8  enquiries. 

dTank  makes  it  easy  to  test  models 

Part  of  Herbal’s  success  is  because  we  can  create  and  test  interesting  Herbal  models  quickly 
using  dTank  (acs.ist.psu.edu/dTank).  dTank  is  a  Java-based  game  explicitly  developed  for 
testing  explanation  of  dynamic,  adversarial  models  (Morgan,  Ritter,  Stevenson,  Schenck,  & 
Cohen,  2005).  It  provides  a  simplified  version  of  the  large  synthetic  environments  like 
ModSAF  for  testing  ideas — It  is  a  ModSAF-like  sandbox. 

dTank  appears  to  be  100  to  l,000x  faster  to  use  (setup  a  model  connection,  add  a  model,  run 
a  model)  than  a  full  synthetic  environment.  These  numbers  are  based  on  having  junior  and 
senior  undergraduate  students  and  graduate  students  at  Penn  State  and  Lock  Haven  use 
dTank  within  5  minutes.  dTank  is  not  a  replacement  for  systems  like  ModSAF,  jSAF,  and 
OTB.  But  when  their  size  and  complexity  are  not  needed,  dTank  offers  a  usefully  sized, 
ready-built  alternative  with  some  surface  validity. 

dTank  allows  multiple  players  and  teams  on  multiple  machines,  and  works  where  Java 
works.  Recently  it  has  been  made  more  robust.  The  correspondence  between  agents  and 
human  vision  has  been  improved;  its  speed  has  been  improved;  its  interface  has  been  made 
easier  to  use;  and  we  have  now  four  architectures  use  it  (CAST,  Soar,  Jess,  and  Java).  We 
have  examined  how  to  include  additional  architectures  like  ACT-R,  JACK,  and  COJACK. 
Our  preliminary  work  suggests  that  these  should  be  straightforward. 

dTank  has  been  used  by  an  Army  MURI  (Sun,  Councill,  Fan,  Ritter,  &  Yen,  2004)  to 
compare  teamwork  in  different  architectures.  It  has  been  used  at  Lock  Haven  U.  (www 
.lhup.edu/~mcohen/  dTank/dTankJess.htm,  Cohen,  2005)  and  at  the  Federal  U.  of 
Uberlandia  (Brazil)  to  teach  agent  programming.  At  least  two  other  universities  and  one 
government  lab  and  one  private  contactor  have  made  enquiries  about  using  it.  We  have  an 
informal  agreement  that  a  version  will  be  included  in  the  ACT-R  distribution  in  the  future. 
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SegMan  offers  a  way  for  models  to  interact  with  interfaces 

Models  need  to  be  able  to  interact  with  task  environments.  As  a  portion  of  our  project  Ritter 
had  several  students  work  with  other  resources  to  explore  this  problem.  Sim-eyes  and  - 
hands  are  a  way  for  models  to  interact  directly  with  interfaces.  We  continued  to  work  on 
apply  SegMan  (St.  Amant  &  Riedl,  2001)  to  support  this  approach.  SegMan  is  a  tool 
developed  by  Robert  St.  Amant.  It  parses  screen  bitmaps  and  generates  operating  system 
function  calls  to  allow  models  and  agents  to  interact  with  interfaces. 

Figure  5  shows  some  of  the  interfaces  that  we  have  used  SegMan  to  interact  with.  In  these 
cases  the  interfaces  were  not  modified  to  support  the  interaction.  Thus,  the  first  telephone 
interface,  while  written  in  Tcl/Tk,  did  not  have  to  be  modified  to  be  interacted  with  (St. 
Amant  et  al.,  2005).  The  driving  game  was  written  in  Java,  and  taken  from  the  net.  We 
used  SegMan  to  predict  driving  performance,  and  we  did  this  without  modifying  the 
interface  (Ritter,  Van  Rooy,  St.  Amant,  &  Simpson,  in  press).  The  casino  interface  was,  in 
fact,  presented  as  a  bitmap  to  the  user  through  a  browser,  and  was  designed  to  not  be  usable 
by  non-human  agents.  A  graduate  student  in  a  weekend  was  able  to  create  a  model,  install 
SegMan,  and  test  a  theory  about  gambling  strategies.  The  cell  phone  is  a  bitmap  taken  from 
the  Internet.  We  created  a  model  to  test  how  to  optimize  its  design  (St.  Amant,  Horton,  & 
Ritter,  accepted  2005).  The  robot  interface  was  not  instrumen table.  A  master’s  student  used 
an  ACT-R  model  to  predict  human  performance  on  the  interface  (Kukreja,  2004). 


Figure  5.  Interfaces  that  models  interact  with  using  SegMan. 
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This  approach  lets  models  be  applied  to  more  interfaces,  and  while  not  perfect  yet,  it  appears 
to  do  so  faster  than  previous  approaches  of  instrumenting  interfaces.  We  estimate  that  it  can 
interact  with  new  cell  phone  interfaces,  for  example,  20  times  faster  than  instrumenting 
them. 

Results  from  the  G2A  supports  the  role  of  compilers  in  modeling 

We  also  worked  on  a  compiler  for  ACT-R.  this  project  was  also  done  with  Robert  St.  Amant.  This 
compiler,  GOMS  to  ACT-R  (G2A),  was  primarily  done  by  St.  Amant  for  other  purposes.  Our  paper 
was  awarded  the  Siegel-Wolf  Award  for  best  applied  modeling  paper  at  the  2004  International 
Conference  on  Cognitive  Modeling  (St.  Amant  &  Ritter,  2004),  and  was  subsequently  published  as  a 
journal  article  (St.  Amant  et  al.,  2005). 

G2A  appears  to  allow  users  to  generate  code  about  20x  faster  than  plain  ACT-R.  This  measure  is 
based  on  two  people  creating  models  to  use  cell  phones  (including  some  of  the  ones  shown  in  Figure 
5).  It  took  about  2  weeks  for  a  masters  student  who  knew  ACT-R  to  make  an  ACT-R  model,  and 
about  two  hours  using  G2A.  (Thus,  the  ratio  is  something  like  40  hours  to  2  hours,  but  related 
changes  will  remain  much  faster.)  G2A  code  expands  at  about  a  factor  of  7.5  to  1,  which  suggests 
that  20x  might  be  a  somewhat  high  prediction. 

Work  with  Vista  illustrates  the  role  of  displays 

We  put  some  effort  into  exploring  how  to  use  Vista  to  help  create  displays  for  Herbal.  Figure  6  shows 
a  display  we  created  to  show  sequential  behavior  (Daughtry  &  Ritter,  2005).  It  has  been  used  to 
explain  models  in  Herbal  in  reports  (Morgan,  Ritter  et  al.,  2005),  for  example,  to  illustrate  strategies. 
Figure  6  shows  how  a  tank  in  dTank  basically  waits  until  something  is  visible.  Then  some  cognition 
occurs  in  preparation  for  attack,  then  another  precept  indicates  that  it  is  time  to  attack. 


Figure  6.  Vista  display  of  a  Herbal  agent  playing  dTank. 


Summary 

The  project  that  developed  Herbal  and  dTank  has  created  a  reusable  design  to  create  high- 
level  behavior  representation  languages  that  can  provide  explanations  at  the  model  level. 
The  design  is  based  on  user  studies  and  surveys  of  the  literature.  The  resulting  models  are 
organized  graphically  in  an  ontology  editor,  and  should  be  easier  to  comprehend  and  reuse 
because  of  this  standard,  portable  representation  (but  this  reusability  needs  to  be  tested 
further). 
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The  resulting  system,  Herbal,  provides  now  nearly  all  the  explanation  types  that  users 
request,  but  the  usability  of  these  explanations  still  need  to  be  improved. 

Herbal  has  been  initially  tested  with  over  60  users.  These  tests  have  led  to  some  successes 
as  noted  in  its  use  by  undergraduate  students,  graduate  students,  and  colleagues  (Cohen, 
2005;  Cohen,  Ritter,  &  Haynes,  2005;  Morgan,  Cohen  et  al.,  2005).  Herbal  will  need  to  be 
tested  with  more  advanced  users  creating  larger  systems. 

Herbal’s  design  appears  to  be  quite  sound,  and  it  should  now  be  applied  at  this  point  by 
modifying  Herbal  to  compile  into  additional  base  languages  besides  Soar,  such  as  JESS  and 
ACT-R.  It  should  also  be  applied  by  creating  models  that  use  the  metacognitive  facilities 
Herbal  can  include  in  a  model  to  create  an  intelligent  opponent  in  dTank. 

dTank  works,  but  it  needs  to  be  made  even  easier  to  use  as  people  find  more  uses  for  it,  and 
want  to  run  larger  simulations  to  test  models,  and  to  run  these  tests  more  routinely. 

These  open  tasks  are  presented  in  expanded  form  in  the  next  section. 

2.  Future  Work  with  Herbal 

Here  we  describe  the  new  tasks  suggested  by  the  technical  basis  above. 

Taskl:  Extending  Herbal 

Herbal  needs  to  be  extended.  Herbal  has  been  used  to  create  modest  sized  systems  ranging 
from  10  to  100  rules.  The  basic  technology,  Protege,  has  been  used  to  create  ontologies  with 
up  to  100,000  objects,  so  we  believe  that  Herbal  can  scale  up,  but  this  remains  an  open, 
empirical  question.  One  has  to  imagine,  however,  that  400  operators  in  Herbal  would  be  far 
easier  to  write  and  manipulate  than  1,200  unstructured  productions,  but  this  improved 
usability  remains  to  be  demonstrated  for  larger  models.  Smaller  models  appear  to  be  at  least 
3x  faster  to  write.  We  are  currently  writing  some  models  to  test  how  Herbal  scales,  but  this 
remains  an  area  to  test  and  extend  Herbal,  and  resources  are  needed  to  demonstrate  a  scaling. 

There  are  also  several  areas  where  our  current  users  have  pointed  out  where  Herbal  can  be 
expanded  and  improved.  These  include  the  ability  to  do  some  automatic  model  checking  and 
the  ability  to  make  the  details  easier  to  enter  and  write.  We  also  will  use  the  design  rational 
in  Herbal  to  provide  concurrent  audio  output  from  the  model.  This  output  will  provide  an 
additional  explanation  modality 

Programming  in  Herbal  language  involves  instantiating  objects  using  these  ontological 
classes.  The  process  of  programming  a  model  is  reduced  to  instantiating  objects  from  a  set 
of  classes,  instead  of  coding  structure  implicitly  in  and  across  a  large  set  of  heterogeneous 
Soar  productions.  When  objects  are  instantiated,  the  ontology  includes  additional  attributes 
that  allow  the  developer  to  describe  the  intent  of  the  object.  This  descriptive  information  is 
automatically  embedded  in  the  generated  Soar  code  as  comments  when  the  model  is 
compiled,  and  will  be  used  in  future  version  of  Herbal  to  generate  concurrent  and  post-task 
explanations.  The  explanation  display  (the  Herbal  Viewer)  also  includes  a  listing  of  the 
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model  operators,  and  tools  to  explore  and  manipulate  them,  and  their  substructures  in  a 
uniform  way. 

The  Herbal  Viewer  is  a  tool  designed  to  provide  a  portal  into  an  intelligent  agent  developed 
using  the  Herbal  IDE.  The  Viewer  currently  includes  basic  explanation  capabilities  that  will 
be  significantly  expanded  as  part  of  the  work  proposed  here.  In  particular,  we  propose  to  use 
results  from  the  various  analytic  and  empirical  evaluation  studies  discussed  below  as  part  of 
a  feedback  loop  driving  further  development  of  the  Herbal  tools. 

These  research  activities  will  result  in  a  number  of  products  for  dissemination  including  the 
Herbal  language  specification,  the  Herbal  language  implementation  as  an  open-source 
Protege  plug-in,  and  the  Herbal  Viewer  as  an  open-source  Java  application.  In  addition,  we 
have  published  papers  on  a  number  of  topics  related  to  the  design  and  development 
component  of  the  project.  These  present  a  reusable  design  and  the  lessons  learned.  We  have 
prepared  a  paper  on  design  patterns  for  explanation  facilities  in  intelligent  systems, 
explanation  filtering  and  tailoring  techniques,  and  explanation  visualization. 

Task  2:  Testing  Herbal 

Herbal  further  testing  and  augmentation.  We  will  be  testing  Herbal  in  two  ways.  We  will 
run  a  more  formal  user  study  on  how  fast  users  can  create  models  in  Herbal.  Second,  we 
will  modify  Herbal  to  compile  into  another  AI  language,  such  as  Sandia’s  Java  Expert 
System  Shell  (JESS,  Friedman-Hill,  2003). 

Task  3:  Applying  Herbal 

Herbal  needs  to  be  applied  to  new  problems  and  new  domains.  This  will  provide  further 
lessons  for  the  language,  but  it  will  also  harvest  and  apply  the  results  so  far.  We  will 
integrate  Herbal  agents  into  a  large-scale  system  designed  to  enhance  defensive  anti¬ 
terrorism  planning  and  resource  allocation.  We  will  develop  a  Herbal  agent  in  the  USMC 
anti-terrorism  resource  allocation  decision  model  (Haynes,  Kannampallil,  Larson,  &  Garg, 
2005)  as  an  example  use  of  Herbal  in  systems  applicable  to  military,  government,  and 
industry  contexts. 

Task  4:  Updating  dTank 

Herbal’s  associated  sandbox  needs  to  be  upgraded.  We  have  developed  and  used  the  dTank 
microworld  (Morgan,  Ritter  et  al.,  2005)  to  illustrate  Herbal’s  explanation  capabilities.  It  is 
from  lOOx  to  l,000x  easier  to  use  than  JSAF  and  related  large  simulations,  so  it  is  useful  for 
testing  models.  It  is  so  easy  to  hook  models  up  to  it  that  undergraduates  have  used  it 
routinely  in  class. 

dTank  has  also  been  used  by  Java,  Soar,  Soar/Slip,  and  CAST  agents  (Ritter,  Sun,  Schenck, 
Gross,  &  Stevenson,  in  preparation).  Two  MoD  projects  and  an  DMSO  project  both  plan  to 
use  it,  and  it  has  been  used  by  an  Army  MURI  (Sun  et  al.,  2004).  ACT-R  has  agreed,  in 
principle,  to  include  a  version  in  their  next  ACT-R  release.  It  has  the  promise  of  being  a 
successful  and  widely  used  piece  of  software  to  support  work  in  synthetic  environments. 
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Task  5:  Creating  intelligent  opponents  with  Herbal 

Herbal  can  be  used  to  create  more  intelligent  opponents.  Herbal  provides  a  declarative 
representation  of  the  knowledge  in  the  model.  We  can  create  a  model  that  reasons  about  its 
behavior  and  modifies  its  behavior  and  learns  using  meta-cognitive  methods  because  this 
representation  knowledge  is  available.  This  will  provide  a  better  model  of  learning,  and  also 
a  more  intelligent  opponent.  It  will  also  stretch  and  demonstrate  our  explanation 
capabilities,  as  this  will  produce  a  model  that  has  something  to  explain,  and  it  will  be  a 
model  that  people  can  learn  from. 

The  exploratory  version  of  this  model  will  be  created  in  dTank.  This  approach  makes  the 
process  more  tractable,  and  allows  much  more  rapid  exploration. 

Summary 

Our  multidisciplinary  research  program  has  helped  introduce  a  new  concept  to  cognitive 
architectures,  that  of  a  high-level  language  and  compiler.  The  compiler  is  based  on  a  theory 
and  empirically  derived  design.  The  resulting  compiler,  Herbal,  has  been  tested  in  a  variety 
of  ways.  Feedback  from  its  use  and  from  an  ONR  program  review  workshop  has  lead  to 
suggestions  for  extensions  and  further  tests. 

Our  design  supports  applying  the  results  quite  widely  by  putting  them  into  existing  pieces  of 
software  with  large  user  communities.  Our  core  team  of  investigators  has  shown  that  they 
are  willing  and  able  to  combine  their  separate  research  programs  to  create  a  new  approach  to 
cognitive  modeling,  one  that  applies  results  from  software  engineering  and  empirical  tests  of 
software  to  the  area  of  cognitive  modeling. 

Our  team  now  understands  how  to  design,  create,  test,  and  distribute  a  high  level  language 
for  modeling.  Our  team  has  worked  extensively  together  to  create  Herbal  and  dTank,  and 
we  are  able  to  rapidly  include  changes  to  the  system. 

This  project  hastens  the  transition  of  basic  research  findings  to  practical  application.  The  use 
of  a  high-level  compiler  for  cognitive  architectures  can  be  and  has  been  used  immediately  by 
others,  and  as  we  propose  here,  could  also  be  applied  to  ACT-R  models  in  synthetic 
environments.  The  tasks  proposed  here  will  provide  several  applications. 

3.  Relevance  to  the  Navy  of  These  Results 

We  note  payoffs  for  the  Navy  here,  as  well  as  several  accelerators  of  these  payoffs.  At  an 
abstract  level,  the  relevance  of  the  research  and  technology  development  proposed  here 
relates  to  any  situation  involving  design  of  complex  systems  and  subsequent  exploration, 
explanation  about  these  designs  including  their  intention,  the  rationale  behind  their  form, 
and  the  constraints  that  dictate  what  is  possible  in  today’s  engineering,  economic,  and  socio¬ 
political  context.  These  results  thus,  with  further  development  will  apply  to  a  wide  range  of 
systems. 
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Creating  models  more  quickly,  more  accurately,  more  affordably  to  help 
with  simulation  and  training 

Models  created  in  Herbal  can  be  used  in  place  of  humans  in  simulation  and  training.  Herbal 
appears  to  make  models  3x  to  7x  faster  to  create.  Making  models  easier  to  create  will  not 
just  save  time;  ease  of  use  should  also  lead  to  creating  more  complete  and  more  accurate 
colleagues  and  opponents  to  support  training  activities.  Making  models  easier  to  use  for 
design  and  for  synthetic  environments  is  important  for  the  Navy. 

More  understandable  agents  and  opponents 

Models  created  in  Herbal  or  in  tools  that  use  Herbal’s  design  will  include  more  explanatory 
information.  This  will  assist  in  their  debugging,  in  their  verification  and  validation,  and  in 
their  use  in  synthetic  environments.  It  also  means  that  users  and  other  project  stakeholders 
are  provided  with  a  much  richer  depth  of  knowledge  about  the  tools  they  use  and  the 
domains  in  which  they  work.  Technology  designs  embed  and  inscribe  significant  knowledge 
above  and  beyond  what  is  apparent  to  a  user  of  a  given  technology,  especially  software. 
Among  the  foundations  of  the  work  proposed  here  is  exposing  this  knowledge  so  that  users 
are  able  to  draw  on  the  expertise  marshaled  in  support  of  technology  design. 

Application  to  timely  problem  of  anti-terrorism  force  protection 

Anti-terrorism  planning  is  currently  among  the  most  important  and  high  profile  domains  in 
systems  research.  Since  the  events  of  9/1 1  and  subsequent  analysis  of  the  lessons  learned 
from  those  events,  it  is  broadly  recognized  that  ensuring  Force  Protection  and  Homeland 
Security  requires  not  only  better  access  to  information,  but  more  effective  use  of  the 
information  that  is  already  available  to  planners.  Enormous  quantities  of  information  are 
generated  by  various  forms  of  human,  signals,  electronic,  and  image  intelligence  gathering 
mechanisms,  but  human  and  group  cognitive  limitations  severely  constrain  whether  and  how 
this  information  is  comprehended. 

Anti-terrorism  planning  involves  identifying  and  prioritizing  assets  to  be  protected, 
determining  the  relative  utility  of  different  mitigations  that  can  be  applied  to  protect 
prioritized  assets,  and  then  selecting  the  portfolio  of  mitigation  projects  that  provides  the 
overall  greatest  net  benefit  given  resource  constraints.  Such  planning  is  rarely  performed  in  a 
vacuum,  it  typically  involves  coordination  among  a  number  of  key  roles.  Anti -terrorism 
planning  is  new  to  the  job  description  for  many  of  those  currently  working  in  this  area.  This 
makes  the  usability  of  software  tools  designed  to  support  anti -terrorism  planning  activities  a 
key  requirement  for  these  systems. 

The  purpose  of  the  research  proposed  here  is  to  better  understand  the  usability  of  anti¬ 
terrorism/force  protection  planning  systems  and  to  contribute  to  their  usability  through  the 
addition  of  intelligent  agents  to  augment  human  cognition. 

More  intelligent  opponents 

The  creation  of  an  opponent  that  can  learn  through  reflection  using  a  domain  ontology  will 
provide  more  intelligent  opponents.  These  opponents  will  be  useful  in  training.  They  will 
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also  be  useful  in  analyses.  An  intelligent  opponent  that  learns  can  test  systems,  like  in  the 
Rampart  ATFP  system  (Haynes  et  al.,  2005),  to  a  higher  level  and  more  systematically. 

The  Rampart- Attacker  opponent  may  also  be  useful  as  a  tool  to  explore  how  cognitive 
models  can  act  as  repositories  of  knowledge  in  domains  such  as  terrorist  psychology  and 
tactics.  One  advantage  of  cognitive  modeling  architectures  such  as  Soar  is  that  models  can 
be  easily  evolved  in  response  to  new  research  findings  and  psychological  study  results.  In 
other  words,  as  our  research  progresses  we  expect  to  make  the  Rampart-Attacker  model 
increasingly  intelligent  and  representative  of  the  state-of-knowledge  in  terrorist  psychology. 
The  enhanced  explanation  capability  of  the  Herbal  development  environment  makes  this 
knowledge  more  transparent  to  developers  and  users  of  agents  created  within  the 
environment  thereby  increasing  the  utility  of  these  agents  as  learning  devices. 

An  environment  for  testing  modeling  and  behavior  representations 

One  reason  it  is  expensive  to  develop  models,  architectures,  and  algorithms  is  because  it  is 
expensive  to  tie  a  model  to  task  so  that  the  model  can  ‘do  its  stuff’.  The  cost  of  tying  models 
to  environments  has  been  noted  a  few  times,  but  it  is  an  under-stated  problem,  although  it 
has  been  noted  in  a  few  places  (Pew  &  Mavor,  1998;  Ritter,  Baxter,  Jones,  &  Young,  2000; 
Ritter  et  al.,  2003).  Many  projects  seem  to  be  spending  1/4  of  their  time  or  more  tying 
demonstration  models  to  large  environments  when  a  simple  test  environment  would  help 
answer  many  development  questions.  dTank  offers  a  useful  environment  for  testing  the 
effects  of  errors  and  their  implementation  in  an  architecture,  the  effects  of  moderators  on 
performance,  and  so  on.  It  appears  to  offer  approximately  a  l,000x  speed  up  for  this  task. 

Based  on  an  extensively  used  architecture 

Soar  is  an  important  cognitive  architecture.  It  has  been  widely  used  in  synthetic 
environments  (e.g.,  Jones  et  al.,  1999),  and  is  often  used  for  comparison  with  other 
architectures  (e.g.,  Gluck  &  Pew,  in  press;  Pew  &  Mavor,  1998;  Ritter,  Sun,  Schenck,  Gross, 
&  Stevenson,  submitted;  Shakir,  2002).  It  is  thus  doubly  useful  to  make  Soar  models  easier 
to  understand  and  to  create.  Soar  models  have  shown  that  they  are  useful  in  synthetic 
environments. 

Accelerators  of  these  payoffs 

There  are  several  accelerators  of  these  payoffs.  Technology  transfer  will  be  hastened  by  our 
close  ties  with  three  other  groups:  (a)  We  have  been  sharing  this  work  with  Colin  Sheppard, 
Dstl  (formerly.  Defense  Evaluation  and  Research  Agency,  UK)  as  the  UK  has  been  pursuing 
a  similar  research  program  to  make  models  easier  to  create.  They  have  been  exploring  the 
use  of  the  JACK  agent  architecture  to  create  a  more  cognitive  version  called  COJACK 
(Norling  &  Ritter,  2004).  In  addition  of  this  connection  as  an  outlet,  some  of  the  ideas  of 
JACK,  of  explicit  representation  of  plans,  has  given  us  ideas  for  developing  Herbal. 

(b)  Ritter  is  part  of  the  major  cognitive  modeling  communities,  Soar  and  ACT-R,  and  he  and 
the  ACS  Lab  maintain  the  Frequently  Asked  Questions  List  for  both  Soar  and  ACT-R.  He 
will  disseminate  these  results  through  presentations  at  the  Soar  Workshop,  the  ACT-R 
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workshop,  at  the  International  Cognitive  Modeling  Conference,  and  at  the  Cognitive  Science 
Conference. 

(c)  Haynes  has  worked  closely  with  both  the  United  States  Marine  Corps,  through  the 
Marine  Corps  Research  University,  and  the  Defense  Threat  Reduction  Agency  (DTRA)  on 
system  approaches  to  managing  the  terrorist  threat.  If  funded,  the  work  proposed  here  will  be 
informed  by  these  efforts  and  will  in  turn  inform  ongoing  work  with  these  Department  of 
Defense  organizations. 

In  addition  to  the  traditional  outputs  of  technical  reports,  conference  papers,  and  journal 
articles,  we  take  advantage  of  the  Internet  to  provide  additional  supporting  documentation. 
For  these  tools,  we  have  created  web  sites  in  order  to  encourage  reuse  of  the  theories  and 
models.  These  web  sites  have  included  the  Herbal  system,  its  manuals,  and  dTank  and  its 
documentation  and  supporting  materials.  They  are  available  as  acs.ist.psu.edu/Herbal  and 
acs.ist.psu.edu/dTank.  We  have  created  movies  illustrating  model  behavior  before,  and  they 
have  been  useful  for  explaining  the  model  when  the  model  is  not  available  or  is  difficult  to 
install.  As  well  as  supporting  classes  and  local  users,  these  websites  have  lead  to  at  least  one 
user  picking  up  dTank  and  using  it  from  a  distance. 
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